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ANTIGEN-BINDING PROTEINS 



FIELD OF THE INVENTION 
5 The present invention relates to the preparation of multivalent 



invention relates to the preparation of antigen binding proteins 
comprising a plurality of binding units linked in series by 
means of intervening polypeptide linker groups , the amino acid 
10 sequence of which linker groups confer restricted conformational 
flexibility. 

BACKGROUND OF THE INVENTION 

15 There is considerable interest in the preparation of multivalent 
and/or multispecif ic antigen binding proteins. Antigen binding 
proteins which are multivalent (that is, comprise more than one 
antigen binding site) , more especially those which are also 
multispecif ic (where the antigen binding sites have differing 

20 antigen specificities) have found particular application in the 
fields of diagnosis or therapy, for example, where the 
construction of binding proteins having binding activity against 
both target site and diagnostic or therapeutic agent allows for 
targeted delivery of the diagnostic or therapeutic agent to the 

25 intended site of action. Other uses for which multivalent and 
multispecific binding proteins have been proposed include 
assays, such as immunoassays and agglutination assays, and 
purification processes. 

30 Those multivalent, multispecific antigen binding proteins which 
have been described in the literature to date rely, in general, 
on the association of antibody light and heavy chain variable 
domains for the formation of the antigen binding site. 

35 Thus, constructs comprising two or more polypeptide chains are 
described in WO 94/09131 (Scotgen Limited) and WO 97/14719 



and multispecific binding proteins . 



In particular, the 



WO 00/24884 PCT/EP99/08323 

(Unilever) and WO 97/38102 (Unilever) ; multivalent molecules 
comprising two or more single chain Fv molecules linked together 
are described in WO 93/11161 (Enzon Inc.) and WO 94/13806 (Dow 
Chemical Co. ) . 

5 

WO 94/04 67 8 (Casterman et al) describes immunoglobulins capable 
of exhibiting the functional properties of classical, four 
chain, immunoglobulins but which comprise two heavy polypeptide 
chains only and are naturally devoid of light polypeptide 

10 chains. Fragments corresponding to isolated Vh domains or to Vh 
dimers linked by the hinge disulphide are also disclosed. These 
immunoglobulins, which may be isolated from Camelids, do not 
rely on the association of heavy and light chain variable 
domains for the formation of the antigen-binding site; instead, 

15 the heavy chain variable domain (hereinafter Vhh) alone forms 
the complete antigen binding site, constituting a single domain 
binding site. 

In their later patent application, WO 96/34103, Casterman et al 
20 disclose multivalent, multispecif ic constructs comprising Vhh 
fragments combined with a linker sequence. Suitable linker 
sequences disclosed and exemplified are derived from sequences 
corresponding to the hinge domain of an immunoglobulin devoid of 
light chains . 

25 

In the Applicant's co-pending patent publication number 
WO 99/23221, published after the priority date of the present 
application, there are disclosed multivalent, multispecif ic 
antigen-binding proteins comprising a polypeptide comprising in 

30 series two or more single domain binding units which are 
preferably variable domains of a heavy chain derived from an 
immunoglobulin naturally devoid of light chains. The individual 
single domain binding units may suitably be linked by means of 
peptide linkers, preferably flexible peptide linkers, which 

35 allow the variable domains to flex in relation to each other 
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with the aim of ensuring that they can bind to multiple 
antigenic determinants simultaneously. 

There remains a continuing need for the development of improved 
5 methods for producing multivalent and/or multispecif ic binding 
proteins, especially antigen binding proteins. In particular, 
there is commercial interest in producing molecules which not 
only have improved binding activity but which also can be 
produced economically on a large scale. 

10 

SUMMARY OF THE INVENTION 

In a first aspect, the invention provides the use of a 
polypeptide group, the amino acid sequence of which group 
15 confers restricted conf ormatiojial flexibility, as a linking 
group to link binding units in a multivalent binding protein. 

The invention also provides a multivalent binding protein 
comprising a plurality of binding units linked by means of 
20 intervening polypeptide linker groups, the amino acid sequence 
of which linker group confers restricted conformational 
flexibility. 

The invention further provides a nucleotide sequence encoding a 
25 multivalent antigen binding protein according to the invention 
and cloning and expression vectors comprising such nucleotide 
sequences- Also provided are host cells transformed with 
vectors comprising such nucleotide sequences. 

30 As used herein, a 'multivalent binding protein' is a protein 
which has more than one binding units which allow for specific 
binding with a molecule partner in a binding pair. Included 
within this are bivalent, trivalent and so on. Examples of 
suitable binding units include antigen binding domains of 

35 antibodies, binding domains of receptors such as hormone 
receptors, lectins, enzymes, and cell adhesion molecules. A 
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4 

^multivalent antigen binding protein' is a protein which has 
more than one antigen binding unit. 

An ^antigen binding unit' is any structure which exhibits 
5 antigen-binding activity. This may be an antibody or an 
immunologically active fragment thereof. An 'antibody' refers 
to an immunoglobulin which may be derived from natural sources 
or synthetically produced. Unless indicated otherwise, 

'antibody' and ^ immunoglobulin' are used synonymously throughout 
10 this specification. 

An antibody fragment is a portion of a whole antibody which 
retains the ability to exhibit antigen-binding activity. The 
antigen binding site may be formed through association of 
15 antibody light and heavy chain variable domains or may comprise 
individual antibody variable domains, constituting a single 
domain binding site. 

Suitable fragments include Fab (comprising an antibody light 
20 chain associated with the V H and Cm domains of an antibody heavy 
chain), Fv (comprising the variable domains of antibody heavy 
and light chains associated with each other) and scFv 
(comprising an antibody V H domain linked to a Vl domain by a 
flexible peptide linker) fragments. Where the antigen binding 
25 site comprises a single variable domain, this may be a heavy 
chain variable domain, most suitably a heavy chain variable 
domain derived from an immunoglobulin naturally devoid of light 
chains . 

30 'Restricted conformational flexibility' relates to restriction 
of movement of the antigen binding units about the backbone of 
the intervening polypeptide linker group. 

The present invention may be more fully understood with 
35 reference to the following description, when read together with 
the accompanying drawings. For convenience, an antigen binding' 



WO 00/24884 



PCT/EP99/08323 



10 



15 



protein comprising two single binding units is hereinafter 
referred to as a x bi-head' . 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a nucleotide sequence of the Pstl-BstEII insert 
of plasmid p(JR4 640, encoding the heavy chain variable 
domain of an anti-RR6 antibody (denoted R9) from a 
llama. 

Figure 2 shows the nucleotide sequence of the Pstl-BstEII 
insert of plasmid pUR4 601, encoding the heavy chain 
variable domain of an anti-hCG antibody (denoted H14) 
from a llama. 

Figure 3 shows a map of plasmid pUR4619. 



20 



Figure 4 shows the nucleotide sequence within plasmid pUR4619 
which encodes an anti-hCG-anti-RR6 bi specific 
biheaded antigen-binding protein (denoted HI4-R9), 
missing the first 4 and last 3 amino acids. 



25 



30 



Figure 5 shows the A405 signals of an ELISA to determine 
bispecificity of various HI4-R9 biheads. 

Figure 6 shows the scores achieved in a rapid assay technology 
(RAT) format assay following the detection of 1 IU/ml 
hCG (human chorionic gonadotrophin protein) with 
various anti-hCG-anti-RR6 bihead antigen binding 
proteins derived from a llama wherein the anti-hCG 
and anti-RR6 fragments are linked as follows (see 
Example 1.5) : 
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Number Linker 
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no linker (directly 


attached) 
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G-T-S-G-S 




(SEQ. 


ID 


NO. 


1) 
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S-S-S-A-S-A-S-S-A 




(SEQ. 


ID 


NO. 


2) 
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G-S-P-G-S-P-G 




(SEQ. 


ID 


NO. 


3) 


5 


A-T-T-T-G-S-S-P-G-P- 


-T 


" (SEQ . 


ID 


NO. 


4) 
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A-N-H-S-G-N-A-S 




(SEQ. 


ID 


NO. 


5) 



Figure 7 shows a comparison of the sensitivity of detection of 
hCG in a RAT assay using various biheads (see Example 
1.5) . 

5 

DETAILED DESCRIPTION OF THE INVENTION 

The invention is based on the unexpected finding that by using a 
polypeptide linking group conferring restricted conformational 

10 flexibility to link together antigen binding units, multivalent 
antigen binding proteins having advantageous binding affinity, 
as demonstrated by their increased sensitivity of diagnosis and 
detection, are obtained. Furthermore, constructs according to 
the invention may conveniently be produced at high yields 

15 economically and efficiently on a scale appropriate for 
industrial use . 

As is apparent from the discussion of the background to the 
invention above, to the extent that multivalent antigen binding 

20 constructs comprising separate binding units linked together 
have been described at all in the literature, the linking means 
has been provided by flexible peptide groups. Flexibility of 
conformation in the linker group has been considered desirable 
in order to allow the multivalent construct to assume the 

2 5 correct orientation to allow simultaneous binding of multiple 
antigens . 
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Surprisingly, the present inventors have found that by 
restricting the conformational flexibility of the linking 
polypeptide group, multivalent antigen binding constructs having 
improved binding affinity may be obtained. This is entirely 
5 contrary to the teaching in the art that the linking group 
should desirably be flexible. 

The invention is applicable to the preparation of multivalent 
antigen binding constructs comprising antigen binding units 

10 where the antigen binding site is formed through association of 
antibody light and heavy chain variable domains. Preferably, 
however, the constructs prepared according to the invention 
comprise a plurality of single domain binding units, more 
particularly a plurality of heavy chain variable domains derived 

15 from an immunoglobulin naturally devoid of light chains such as 
may be obtained from lymphoid cells, especially peripheral blood 
lymphocytes, bone marrow cells or spleen cells derived from 
Camelids as described in WO 94/04 678 (Casterman et al) discussed 
above. An advantage of using single domain binding units which 

20 are heavy chain variable domains derived from Camelids is that 
they can readily and conveniently be produced economically on a 
large scale, for example, using a transformed lower eukaryotic 
host, as described in WO 94/25591 (Unilever), described above. 

25 It will be appreciated that heavy chain variable domains derived 
from other immunoglobulins modified ( ^camelised' ) to enable them 
to function as single binding domains in the same way as the 
heavy chain variable domains derived from Camelids may also 
suitably be used according to the invention. 

30 

Bivalent forms, that is having two antigen binding sites, of the 
multivalent antigen binding proteins prepared according to the 
invention are preferred but it will be appreciated that higher 
multivalent forms, which are also encompassed in the present 
35 invention, may find application under suitable circumstances, 
for example where more than two antigens are required to bind, 
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for example in processes for scavenging molecules from solution 
or processes where close proximity of molecules form the basis 
of an assay. 

5 Structural features which may suitably be incorporated into the 
linking polypeptide group in order to achieve the effect of 
restricting conformational flexibility according to the purposes 
of the invention would readily suggest themselves to those 
skilled in the art. 

10 

Accordingly, in one embodiment, the linker group preferably 
comprises one or more proline residues. 

Without wishing to be bound by theory, it is generally thought 
15 that the presence of a proline residue in a peptide sequence 
encourages the amino acid backbone of the peptide to adopt a 
beta-turn structural configuration, with the peptide backbone 
changing direction about the proline residue. Linker groups 
comprising other sequence features which promote the formation 
2 0 of a beta-turn configuration in the peptide backbone, such as 
peptide linkers containing valine residues or constrained 
residues such as 8-bicyclic and 5,9-bicyclic tripeptide units 
(see, for example, Johannesson et al, J. Med. Chem., 42, 601-608 
(1999), may also suitably find application in the present 
25 invention. 

In another embodiment, peptide linker groups derived from 
naturally occurring proteins such as cell wall proteins (CWP) , 
in particular, CWP1, or cellobishydrolases (CBH) , such as CBH1P, 
30 which serve to restrict conformational flexibility or linker 
groups showing at least 50% homology thereto as determined by 
the ALIGN program of Dayhoff et al (1983), Methods Enzymol., 91, 
524-545, may also suitably be used according to the invention. 

35 Peptide linker groups which encode a glycosylation binding site 
and/or are resistant to proteolytic attack may also suitably be 
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employed. Here, the presence of a carbohydrate attached to the 
amino acid residues has the effect of restricting the 
flexibility of the peptide backbone. 

5 Conveniently, the polypeptide linking group according to the 
invention comprises from 4 to 30 amino acid residues, preferably 
from 5 to 15 amino acid residues. 

Preferred polypeptide linking groups according to the invention 
10 comprise an amino acid sequence selected from: 

S-S-S-A-S-A-S-S-A, (SEQ- ID NO. 2) 

G-S-P-G-S-P-G, (SEQ. ID NO. 3) 

A-T-T-T-G-S-S-P-G-P-T (SEQ. ID NO. 4) 

15 

It will be appreciated that although the invention has been 
described primarily by reference to antigen binding proteins, it 
is equally applicable to proteins comprising other binding units 
as described above. References to antigen binding proteins will 
20 accordingly be understood to refer also to such other proteins 
unless the context dictates otherwise. 

Multivalent antigen binding proteins according to the invention 
may be prepared by transforming a host by incorporating a gene 
25 encoding the polypeptide as set forth above and expressing said 
gene in said host. 

Suitably the host or hosts may be selected from prokaryotic 
bacteria, such as Gram-negative bacteria, for example E. coll, 
30 and Gram-positive bacteria, for example B. subtilis or lactic 
acid bacteria, lower eukaryotes such as yeasts, for example 
belonging to the genera Saccharomyces; Kluyveromyces, Hansenula 
or Pichia, or moulds such as those belonging to the genera 
Aspergillus or Trichoderma . 

35 
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Preferred hosts for use in connection with the present invention 
are the lower eukaryotic moulds and yeasts. 

Techniques for synthesising genes, incorporating them into hosts 
5 and expressing genes in hosts are well known in the art and the 
skilled person would readily be able to put the invention into 
effect using common general knowledge. 

Methods for producing antibody fragments or f unctionalised 
10 fragments thereof derived from the heavy chain immunoglobulin of 
Camelidae using a transformed lower eukaryotic host are 
described, for example in patent application WO 94/25591 and 
such techniques may suitably be applied to prepare constructs 
according to the present invention. 

15 

Proteins according to the invention may be recovered and 
purified using conventional techniques such as affinity 
chromatography, ion exchange chromatography or gel filtration 
chromatography . 

20 

The activity of the multivalent binding proteins according to 
the invention may conveniently be measured by standard 
techniques known in the art such as enzyme-linked 
immunoadsorbant assay (ELISA) , radioimmune assay (RIA) or by 
25 using biosensors. 

The following examples are produced by way of illustration only. 
Techniques used for the manipulation and analysis of nucleic 
acid materials were performed as described in Sambrook et al, 
30 Molecular Cloning, Cold Spring Harbor Press, New York, 2nd Ed., 
(1989) unless otherwise indicated. 



35 



Restriction sites are underlined. 

HC-V denotes heavy chain variable domain. 
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Example 1 Self Assembling Llama Bi-heads Containing Linker 
Peptides on Latex to Assay hCG 

5 

1 . 1 Construction of Llama Bi-heads with Various Linkers 



a) Induction of humeral immune responses in llama 

10 Male llamas were immunised with a water in oil emulsion (1:9 
V/V, antigen in water: Specol (Bokhout et al, Vet. Immunol. 
Immunopath . , 2:, 491-500 (1981)) subcutaneously and 
intramuscularly. Per immunisation site 0.75-1.5 ml water in oil 
emulsion was inoculated containing 100 :g antigen. The antigens 

15 used were: hCG (Sigma), azo-dye RR6 (ICI) which was coupled to 
BSA via its reactive triazine group. Immunisations were 
performed according to the following time table: The second 
immunisation was performed three weeks after the first. The 
third was performed two weeks after the second iiranunisation. 

20 The immune response was followed by antigen specific ELISAs. 

The anti-RR-6 response was measured by using Nunc Covalink 
plates, which where coated with the azo-dye. After incubation 
with (diluted) serum samples, the bound llama antibodies were 

2 5 detected via a incubation with poly-clonal rabbit-anti- llama 
antiserum (obtained via immunising rabbits with llama 
immunoglobulins which were purified via ProtA and ProtG columns; 
ID-DLO) , followed by an incubation with swine-anti-rabbit 
immunoglobulins (Dako) conjugated with alkaline phosphatase. 

30 Finally the alkaline phosphatase enzyme-activity was determined 
after incubation with p-nitro-phenyl phosphate and the optical 
density was measured at 405nm. The anti-hCG response, was 
measured in essentially the same way using Nunc maxi-sorb plates 
coated with hCG. 
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b) Cloning, expressing and screening of llama HC-V fragments 



10 



15 



20 



i) Isolation of gene fragments encoding llama HC-V domains 

From an immunised llama a blood sample of about 200ml was taken 
and an enriched lymphocyte population was obtained via Ficoll 
(Pharmacia) discontinuous gradient centrif ugation . From these 
cells, total RNA was isolated by acid guanidium thiocyanate 
extraction (e.g. via the method described by Chomczynnski and 
Sacchi, Analytical Biochemistry, 162: 156-159 (1987). After 
first strand cDNA synthesis (e.g. with the Amersham first strand 
cDNA kit) , DNA fragments encoding HC-V fragments and part of the 
long or short hinge region were amplified by PCR using specific 
primers : 

PstI 

V„ - 2B 5 ' -AGGTSMA RCTGCAG SAGTCWGG-3 1 (SEQ. ID NO. 6) 

S = C and G, M - A and C, R = A and G, W = A and T, 



Hindi 1 1 

Lam- 07 5 1 -AACAGT TAAGCTT CCGCTTGCGGCCGCGGAGCTGGGGTCTTCGCTGTGGTGCG-3 1 
(short hinge) {SEQ. ID NO. 7) 



25 Hindi 1 1 

Lam-08 5 1 -AACAGTT AAGCTT CCGCTTGCGGCCGCTGGTTGTGGTTTTGGTGTCTTGGGTT-3 f 

(long hinge) (SEQ- ID NO. 8) 

Upon digestion of the PCR fragments with PstI (coinciding with 
30 codon 4 and 5 of the HC-V domain, encoding the amino acids L-Q) 
and BstEII (located at the 3 1 -end of the HC-V gene fragments, 
coinciding with the amino acid sequence Q-V-T) , the DNA 
fragments with a length between 300 and 400bp (encoding the HC-V 
domain, but lacking the first three and the last three codons) 
35 were purified via gel electrophoresis and isolation from the 
agarose gel. 
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ii) Construction of Saccharomyces cerevisiae expression 
plasmids encoding llama HC-V domains 

Plasmids pUR4547 and pUR4548 are Saccharomyces cerevisiae 
5 episomal expression plasmids, derived from pSYl (Harmsen et al . , 
Gene, 125: 115-123, (1993). From pSYl the PstI site, located in 
front of the GAL 7 promoter was removed after partial digestion 
with PstI, incubation with Klenow fragment and subsequent blunt 
end ligation. After transformation the desired plasmid could be 
10 selected on the basis of restriction pattern analysis. 
Subsequently, the BstEII site in the Leu2 selection marker was 
removed by replacing the about 410bp AfllX/PflMI fragment with a 
corresponding fragment in which the BstEII site was removed via 
a three step PCR mutagenesis, using the primers: 

15 

PCR-A: 

PfJML 

BOLI 1 5 ' -GGGAATTCCAATAGGTGGTTAGCAATCG (SEQ. ID NO. 9) 

20 (BstEII) 

BOLI 4 5 ' -GACCAACGTGGTCGCCTGGCAAAACG (SEQ. ID NO. 10) 

PCR-B: 

(BstEII) 

2 5 BOLI 3 5 ' -CGTTTTGCC AGGCGACCA CGTTGGTC (SEQ. ID NO. 11) 

AflU 

BOLI 2 5 1 -CCCCAAGCTTACATGGT CTTAAG TTGGCGT (SEQ. ID NO. 12) 

30 

PCR-A was performed with primers BOLI 1 and BOLI 4 and resulted 
in an about 130bp fragment with the PflMI restriction site at 
the 3' -end and the inactivated BstEII site at the 5' -end. PCR-B 
was performed with primers BOLI 2 and BOLI 3 and resulted in an 
35 about 290bp fragment with the Aflll site at the 5' -end. The 
third PCR was with the fragments obtained from reaction A and B, 
together with the primers BOLI 1 and BOLI 2 . 
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Finally, the about 1.8Jcb SacX-HindUl fragment was replaced with 
synthetic fragments, having sequences as presented below, 
resulting the plasmids pUR4547 and pUR4548, respectively. 

5 - SacI/Jfi/JdIII fragment of pUR4547 



Sacl (SEQ. ID NO. 13-16) 

GAGCTCATCACACAAACAAACAAAACAAAATGATGCTTTTGCAAGCCTTCCCTT 

10 CTCGAGTAGTGTGTTTGTTTGTTTTGTTTTACTACGAAAACGTTCGGAAGGGAA 

MMLLQAFLF 
|-» SUC2 ss 

PstI 

1 5 TTCCTTTTGGCTGGTTTTGCAGCCAAAATATCTGCGCAGGTGCAGCTGCAGG 

AAGGAAAACCGACCAAAACGTCGGTTTTATAGACGCGTCCACGTCGACGTCC 
LLAGFAAKI SAQVQLQE 

20 

BstEII HindlXl 
AGTCATAATGAGGGACCCAGGTCACCGTCTCCTCATAATGACTTAAGCTT 

106 + + + + + 155 

TCAGTATTACTCCCTGGGTCCAGTGGCAGAGGAGTATTACTGAATTCGAA 
25 ES**GTQVTVSS** 
HC-V cassette <-| 

and 



30 - Sacl/Hlndlll fragment of pUR4548 



Sacl (SEQ. ID NO. 17-20) 

GAGCTCATCACACAAACAAACAAAACAAAATGATGCTTTTGCAAGCCTTCCTTT 

3 5 CTCGAGTAGTGTGTTTGTTTGTTTTGTTTTACTACGAAAACGTTCGGAAGGAAA 

MMLLQAFLF 
|-> SUC2 ss 

PstI 

40 TCCTTTTGGCTGGTTTTGCAGCCAAAATATCTGCGCAGGTGCAGCTGCAGG 

AGGAAAACCGACCAAAACGTCGGTTTTATAGACGCGTCCACGTCGACGTCC 
LLAGFAAKI' SAQVQLQE 



45 
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BstEII 

AGTCATAATGAGGGACCCAGGTCACCGTCTCCTCAGAACAAAAACTCATC 

TCAGTATTACTCCCTGGGTCCAGTGGCAGAGGAGTCTTGTTTTTGAGTAG 
5 S**GTQVTVSSEQKLI 

HC-V cassette <— !-> , myc tail 

Hindi I I 

TCAGAAGAGGATCTGAATTAATGACTTAAGCTT 

10 156 + + + 188 

AGTCTTCTCCTAGACTTAATTACTGAATTCGAA 
SEEDLN** 

<r-\ 

15 Both plasmids contain the GAL7 promoter and PGK terminator 
sequences as well as the invertase (SUC2) signal sequence. In 
both plasmids the DNA sequence encoding the SUC2 signal sequence 
is followed by the first 5 codons, (encoding Q-V-Q-L-Q) of the 
HC-V domain (including the Sstll site) , a stuf fer sequence, the 

20 last six codons (encoding Q-V-T-V-S-S) of the HC-V domain. In 
pQR4547, this is followed by two stop codons, an Aflll and 
Hindlll site. In pUR4548, this sequence is followed by eleven 
codons encoding the myc- tag, two stop codons, an Aflll and 
tfindlll site. 

25 

Plasmids pUR4547 and pUR4548 were deposited under the Budapest 
Treaty at the Centraal Bureau voor Schimmelcultures, Baarn on 
18th August 1997 with deposition numbers: CBS 100012 and CBS 
100013, respectively. In accordance with Rule 28(4) EPC, or a 
30 similar arrangement from a state not being a contracting state 
of the EPC, it is hereby requested that a sample of such 
deposit, when requested, will be submitted to an expert only. 

Upon digesting pUR4548 with PstI and BstEII, the about 6.4kb 
35 vector fragment was isolated and ligated with the Fstl-BstEII 
fragments of about 350bp obtained as described above. After 
transformation of S. cerevisiae, via electroporation, 
transf ormants were selected from minimal medium agar plates 
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(comprising 0.7% yeast nitrogen base, 2% glucose and 2% agar, 
supplemented with the essential amino acids and bases) . 

iii) Screening for antigen specific HC-V domains 

5 

For the production of llama HC-V fragments with myc-tail, 
individual transf ormants were grown overnight in selective 
minimal medium (comprising 0.7% yeast nitrogen base, 2% glucose, 
supplemented with the essential amino acids and bases) and 

10 subsequently diluted ten times in YPGal medium (comprising 1% 
yeast extract, 2% bacto pepton and 5% galactose) . After 24 and 
4 8 hours of growth, the culture supernatant of the colonies was 
analysed by EL ISA for the presence of HC-V fragments which 
specifically bind to the antigens hCG, RR6 in essential the same 

15 way as described above. In this case, however, the presence of 
specifically bound HC-V fragments was detected by incubation 
with monoclonal anti-myc antibodies, followed by incubation with 
poly-clonal rabbit-anti-mouse conjugate with alkaline 
phosphatase. In this way a number of anti-hCG and anti-RR6 HC-V 

20 fragments were isolated, which are: 

anti-RR6: 

R9 pUR4640 (see Figure 1) (SEQ. ID NO. 21-22) 

25 

anti-hCG (alpha unit) : 

H14 pUR4601 (see Figure 2) (SEQ. ID NO. 23-24) 

30 c) Production of llama HC-V biheads by S. cerevisiae 

i) Construction of episomal expression plasmids encoding anti- 
hCG/anti-RR6 bispecific biheads 



r 
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In the anti-hCG HC-V fragment HI 4 (anti-alpha-subunit ) , the PstI 
site was removed and a Xhol site was introduced via PCR, using 
the primers : 

MPG158WB 

Xhol 

5 * -GAATTAAGCGGCCGCCCAGGTGAAACTGCTCGAGTCWGGGGGA-3 ' (SEQ. ID NO. 25) 



and 



MPG159WB 

BstEII 

3 1 -CCCTGGGTCCAGTGGCAGAGGAGTGGCAGAGGAGTCTTGTTT-5 ' (SEQ. ID NO. 26) 



15 In this way the sequence: 
PstI 

CAG GTC CAG CTG CAG GAG TCT GGG (SEQ. ID NO. 27) 

QV -QLQESG 



10 



20 



became 



Xhol 

CAG GTG AAA CTG CTC GAG TCW GGG (SEQ. ID NO. 28) 

25 QVKLLESG 



Upon digesting the PCR fragments with Xhol and BstEII, the about 
330bp fragments were purified via agarose gel electrophoresis 
and isolation from the gel. The fragments were cloned into 

30 pUR4421 (see Example 1 in WO 94/25591) which was digested with 
the same enzymes, resulting in pJS2 (H14) . Subsequently, the 
about 420bp £agl -Hindi 1 1 fragment of pJS2 was isolated and 
ligated in the about 6.6kb EagI- Hindlll vector fragment of the 
pSYl plasmid of which the PstI and BstEII sites were removed as 

35 described above. The resulting plasmid pJS7, was digested with 
BstEII and Hindi II, after which the purified vector fragment was 
religated in the presence of a synthetic linker having the 
following sequence: 
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(SEQ. ID NO. 29-31) 

BstEII PstI Hindlll 

<- • MPG 160 WB (49) -> 

5 GGTCACCGTCTCCTCACAGGTGCAGCTGCAGGAGTCACTGTAATGACTTAAGCTT 

+ + + + + 55 

CCAGTGGCAGAGGAGTGTCCACGTCGACGTCCTCAGTGACATTACTGAATTCGAA 
<- MPG 161 WB (48) -> 

VTVSSQVQLQESL* *LKL 

10 

resulting in plasmid pJS9. Finally, the plasmid was digested 
with PstI and ffindlll, after which the purified vector fragments 
of about 7.0kb were ligated with the PstI -Hindi II fragments of 
about 350bp of pUR4638 and pUR4640, encoding an anti-RR6 HC-V 
15 fragment denoted R9 followed by the myc-tail. The resulting 
5. cerevislae episomal expression plasmid pUR4 619 encodes a 
anti-hCG-anti-RR6 bispecific bihead preceded by the SUC2 signal 
sequence and followed by the myc-tail. 

20 pUR4619: SUC2 - H14 - R9 - myc 

(see Figures 3-4/SEQ. ID NO. 32-33) 

Upon digesting these plasmids with Xhol and partially with 
BstEII, XhoI-BstEII fragments of about 0.7kb can be isolated and 
2 5 subsequently cloned into the vector fragment of pUR4547 
(digested with the same enzymes) . In this way biheads can be 
obtained without the myc tail. 

It will be appreciated that expression vectors can be 
30 constructed in which different promoter systems, e.g. the 
constitutive GAPDH promoter or different signal sequences, e.g. 
the mating factor prepro sequence are used. 

ii) Production of the HC-V biheads 

35 

After introducing the expression plasmid pUR4 619 into 
S. cerevisiae via elect roporation, transf ormants were selected 
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from minimal medium agar plates as described in part b(ii) 
above. For the production of biheads, the trans formants were 
grown overnight in selective minimal medium and subsequently 
diluted ten times in YPGal medium. After 24 and 48 hours of 
growth, samples were taken for Western blot analysis. For the 
immuno detection of the produced biheads via Western blot 
analysis, monoclonal anti-myc antibodies were used, followed by 
incubation with poly-clonal rabbit -anti-mouse conjugate with 
alkaline phosphatase. 

d) Anti-hCG/anti-RR6 bispecific biheads containing a linker 
peptide 

i) Construction of S. cerevisiae episomal expression plasmids 
15 encoding anti-hCG/anti-RR6 bispecific biheads containing a 

linker peptide 

Between the H14 and the R9 encoding DNA fragments synthetic 
linkers were introduced encoding different linker peptides. To 
20 this end the about 50 bp long BstEU-HindLII fragment of pJS7 
(see Example 1 c(i) above) was replaced by an about 50 bp long 
BstEII-Hindlll fragment having the following sequence: 

MVaJA 

25 BstElI Xbal Drain PstI Hindlll 

5' GTCACCGTCTCTAGATGGCC^ 3' 

{SEQ. ID NO. 34) 

MVbJA 

30 3' GCAGAGATCTACCGGTGGTCCACGTCGAGCTCCTCAGTTGAATTCGA 5' 

(SEQ. ID NO. 35) 

This resulted in pSJ7a. In this plasmid the about 20 bp Pstl- 
Hindlll fragment was replaced with the about 370 bp Pstl-Hindlll 
35 fragment encoding the anti-RR6 HC-V fragment R9 and/with the 
myc-tail of pUR4 640 (see Example 1 c(i)) and resulting in pSJ7b. 
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Upon digesting plasmid pSJ7b with Xbal and Dra III the about 7 kb 
vector fragment was ligated with five synthetic oligo nucleotide 
linker fragments presented below: 

5 

MV01JA 5' CTAGTGGTACTTCCGGTTCCCAG 3' (SEQ. ID NO. 36) 

MV02JA 3' ACCATGAAGGCCAAGG 5' (SEQ. ID NO. 37) 

10 SGTSGSQ 

MV03JA 5' CTAGTTCTTCATCTGCTTCTGCCTCTTCAGCCCAG 3' 

(SEQ. ID NO. 38) 

15 MV04JA 3' AAGAAGTAGACGAAGACGGAGAAGTCGG 5' (SEQ ID NO. 39) 

SSSSASASSAQ 
MV05JA 5'CTAGTGGTTCTCCAGGTTCACCAGGTCAG 3' (SEQ. ID NO. 40) 

20 

MV06JA 3' ACCAAGAGGTCCAAGTGGTCCA 5' (SEQ. ID NO. 41) 

SGSPGSPGQ 

25 MV07JA 5' CTAGTGCTACTACAACTGGTTCTTCACCAGGTCCAACTCAG 3' 

(SEQ. ID NO. 42) 

MV08JA 3' ACGATGATGTTGACCAAGAAGTGGTCCAGGTTGA 5' 

(SEQ. ID NO. 43) 
30 SATTTGS SPGPTQ 
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MV09JA 5' CTAGTGCTAATCATTCTGGTAATGCTTCTCAG 3' 

<SEQ. ID NO. 44) 

MV10JA 3' ACGATTAGTAAGACCATTACGAAGA 5' (SEQ. ID NO. 4 5) 

5 

SANHSGNASQ 

The oligonucleotide linker fragments encode the last amino acid 
of the N-terminal HC-V fragment (S) and the first amino acid of 
10 the C-terminal HC-V fragment, intersected by the connecting 
linker peptide. This resulted in plasmids pUR5330 to 5334, 
respectively. 

After transformation of S. cerevisiae with these plasmids, the 
15 production levels of the biheads were determined via Western 
blot analysis and a anti-hCG EL ISA using anti-myc mAb for 
detection of the bound bihead (see Example 1 b(iii). Production 
levels are presented in Table 2 below: 

20 Table 2 

Production 



Plasmid Linker level 

(mg/1) 

pUR4619 None 11 

pUR5 330 S -G-T-S-G-S-Q 3 6 

pUR5331 S-S-S-S-A-S-A-S-S-A-Q 49 

pUR5 332 S-G-S-P-G-S-P-G-Q 3 3 

pUR5333 S-A-T-T-T-G-S-S-P-G-P-T-Q 56 

pUR5334 S-A-N-H-S-G-N-A-S-Q 51 



The production levels of the biheads in which the two HC-V 
domains are separated by a linker peptide (consisting of between 
25 5 and 11 amino acids) were found to be 3 to 5 times higher as 
found for the bihead in which the two HC-V fragments are 
connected without a peptide linker. 
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Finally, the bispecif icity of the biheads was demonstrated as 
follows : 

5 PINs coated with hCG were incubated with (diluted) medium 
samples. Subsequently, the PINs were incubated with a RR6- 
alkaline phosphatase conjugate, in which the azo-dye RR6 was 
coupled to the alkaline phosphatase via its reactive triazine 
group. Finally, the alkaline phosphatase enzyme activity was 
10 determined after incubation of the PINs with p-nitro-phenyl 
phosphate and the optical density was measured at 4 05nm (see 
Figure 5) . 

1.2 Purification of Llama Bi-heads with Various Linkers from 
15 S. cerevlsiae Culture Media 

A 5 ml column of recombinant Protein A Fast Flow Sepharose 
(Amersham Pharmacia Biotech) was equilibrated by washing with 10 
column volumes of wash buffer (10 mM potassium phosphate, pH 6), 

20 at a flow rate of 2 ml/min. The bi-head fermentation broth was 
loaded at 2 ml/min in an upwards direction. After loading, the 
column was washed with wash buffer until the OD280 reached the 
baseline. Elution was carried out with a linear gradient of 0 - 
40 mM citric acid pH 2.5 in the reverse direction, collecting 4 

25 ml fractions into tubes containing 400 pi of a neutralising 
agent (1M Tris.Cl, pH 8.5) in order to minimise the effects of 
the acid. Peak fractions were checked for purity by running on 
a 12% SDS-PAGE Ready Gel (Bio-Rad) under standard denaturing 
conditions. Staining was with GelCode Blue (Pierce & Warriner) . 

30 The fractions were concentrated using Macrosep centrifugal 
concentrators (3 kDa molecular weight cut-off, Pall Filtron 
Corp.) then buffer exchanged into lOmM potassium phosphate, pH 6 
using PD-10 columns (Amersham Pharmacia Biotech) . The final 
purity of the sample was determined by carrying out a UV scan 

3 5 from 400 - 220 nm and using the value at 280 nm to determine an 
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accurate concentration. The samples were then aliquoted into 
vials, frozen, freeze dried and stored until required. 

1.3 Preparation of a Reactive Red 6-Bovine Serum Albumin 
5 Conjugate 

A solution of Reactive Red 6 (RR6) was made up at 10 mg/ml in 
phosphate buffered saline (PBS) . A solution of bovine serum 
albumin (BSA) was made up at 10 mg/ml in PBS . 200 ul of the RR6 

10 solution was added to 800 u-1 of the BSA solution and the 
resulting solution was mixed in an end over end rotary mixer for 
2 hours at room temperature. RR6 that had conugated to BSA was 
separated from free RR6 by addition of the reaction mixture (1 
ml) to a PD10 column (Pharmacia) previously washed with 10 ml of 

15 PBS containing 0 . 1% sodium azide (PBSA) . The column was then 
eluted by addition of PBSA (5 ml) and 1 ml aliquots were 
collected. The RR6-BSA conjugate eluted in fractions 4 and 5. 
These were pooled and the concentration of protein was 
determined using a BCA protein test and the concentration 

20 adjusted to 2 mg/ml with PBSA. 

1 . 4 Adsorption of Latex with Reactive Red 6-Bovine Serum 
Albumin Conjugate 

25 Duke blue latex was adsorbed with the RR6-BSA conjugate as 
follows: 

To 950 \xl of 10 mM borate buffer, 0.01 % merthiolate, pH 8.5 
(buffer B) a 50 \il aliqot of Duke blue latex (10 % solids) was 

30 added and mixed by inverting. The diluted latex was centrifuged 
at 8000 c[ for 10 minutes at room temperature, the supernatant 
removed and the pellet vortexed briefly. The pellet was 
resuspended in 900 \il of buffer B and to this 100 fil of the 
previously prepared RR6-BSA conjugate was added. The latex 

35 solution was sonicated for 10 s using a sonic probe . The 
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solution containing the latex was mixed for 2 h at room 
temperature and then centrifuged (8000 <j, 10 min at room 
temperature) . The latex pellet was washed by resuspending in 1 
ml of buffer B and centrifuged once more (8000 cj, 10 min at room 
temperature) . The pellet was then resuspended in 1 ml buffer B 
ready for use. 

1 . 5 Analysis of Llama Bi-head Self Assembling on Reactive Red 
6-Bovine Serum Albumin Adsorbed Latex 

The llama bi-heads were tested by self assembling onto RR6-BSA 
adsorbed latex and detection of hCG in a rapid assay technology 
(RAT) format. This was performed by mixing the llama bi-head (5 
(xl of a 0.1 mg/ml solution) with RR6-BSA adsorbed latex (5 \xl of 
0.1 % solids) in 10 pi of PBSiV to which hCG (5 |ul of various 
concentrations) was added. The resulting solution was added to 
the bottom of a nitrocellulose strip (6 mm wide x 30 mm long) on 
which a monoclonal antibody recognising hCG had been adsorbed by 
plotting in a line (2.5 mg/ml) mid way up the strip. The latex- 
bi-head-hCG solution was allowed to flow up the nitrocellulose 
strip by capillary action and the strip was then washed by 
applying PBSA (25 \xl) to the bottom of the strip. The amount of 
latex , captured at the plotted antibody line on the 
nitrocellulose strip, was quantified by measuring the absorbance 
through the strip. 

Figure 6 shows that the llama bi-heads with linkers 3, 4 and 5 
gave the highest response in RAT assays. These linkers are 
structurally more ordered than the comparative examples, 
flexible linkers 2 and 6 and result in more hCG and more latex 
captured in the assay. The more ordered linkers promote the 
correct orientation of the binding domains to achieve more 
optimal binding than when no linker is used. Linker 3 is 
derived from CWP1 and Linker 5 from CBH1P. 
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Synthetic linkers with some order (linker 4 containing 2 proline 
residues) can offer increased sensitivity in assays than those 
with little order (linker 2) . Figure 7 shows that the bi-head 
with linker 4 can detect lower amounts (50 mlU/ml) of hCG than 
the bi-head with linker 2 and, hence, give a more sensitive 
assay for hCG. 
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CLAIMS 



1. Use of a polypeptide group, the amino acid sequence of 
which group confers restricted conformational flexibility, 
as a linking group to link binding units in a multivalent 
binding protein. 

2. Use according to claim 1 wherein the polypeptide linking 
group comprises from 4 to 30 amino acid residues. 

3. Use according to claim 1 or 2 wherein the linking group 
comprises one or more proline residues. 

4 . Use according to claim 1 or 2 wherein the linking group 
15 comprises an amino acid sequence selected from: 



10 



20 



25 



S-S-S-A-S-A-S-S-A, 
G-S-P-G-S-P-G, or 
A-T-T-T-G-S-S-P-G-P-T . 



5. A multivalent binding protein comprising a plurality of 
binding units linked by means of intervening polypeptide 
linker groups, the amino acid sequence of which linker 
group confers restricted conformational flexibility. 

6. A protein according to claim 5 wherein the binding units 
comprise heavy chain variable domains derived from an 
immunoglobulin naturally devoid of light chains. 

30 7. A protein according to claim 5 or claim 6 wherein the 
antigen binding units comprise heavy chain variable domains 
derived from a Camelid immunoglobulin. 

8. A protein according to any one of claims 5 to 7 comprising 
35 a bivalent antigen binding protein. 
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9. A protein according to any one of claims 5 to 8 wherein the 
linker group comprises from 4 to 30 amino acid residues. 



10. A protein according to any one of claims 5 to 9 wherein the 
linker group comprises one or more proline residues. 

11. A protein according to any one of claims 5 to 9 wherein the 
linker group comprises an amino acid sequence selected 
from: 



S-S-S-A-S-A-S-S-A, 
G-S-P-G-S-P-G, or 
A-T-T-T-G-S-S-P-G-P-T . 



12. Nucleotide sequences encoding for a multivalent binding 
protein of any one of claims 5 to 11. 

13. An expression vector comprising a nucleotide sequence 
according to claim 12. 

14. A host cell transformed with a vector according to claim 
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(54) Title: ANTIGEN-BINDING PROTEINS 
(57) Abstract 



Use of a polypeptide linker group, the amino acid sequence of which group confers restricted conformational flexibility, as a Jinking 
group to link binding units, preferably antigen binding units comprising heavy chain variable domains derived from an immunoglobulin 
naturally devoid of light chains, in a multivalent binding protein. 
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As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next lo my name. 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, 
first and joint inventor (if plural names are listed below) of the subject matter which is claimed and 
for which a patent is sought on the invention entitled: 

ANTIGEN -BINDING PROTEINS 
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was filed as United States application 
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application in accordance with Title 37, Code of Federal Regulations, 51.56(a). 

I hereby claim foreign priority benefits under Title 35, United States Code, §119 of any foreign 
application(s) for patent or inventor's certificate or of any PCT international application(s) 
designating at least one country other than the United States of America listed below and have also 
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I hereby claim the benefit under Title 35, United States Code, §120 of any United States 
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disclosed in that/those prior application(s) in the manner provided by the first paragraph of Title 35, 
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Title 37, Code of Federal Regulations, § 1.56(a) which occurred between the filing date of the prior 
application(s) and the national or PCT international filing date of this application: 
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UNDER 35 U.S.C. 120: 

U.S. APPLICATIONS 

U.S.APPLICATION NUMBER U.S.FILING DATE STATUS 
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SEQUENCE LISTING 

<110> UNILEVER PLC 
5 UNILEVER N.V 

<120> ANTIGEN BINDING PROTEINS 

<130> T3077 

10 

<140> 
<141> 

<160> 45 

15 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 5 
20 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : LINKER 

25 

<4O0> 1 

Gly Thr Ser Gly Ser ; 
1 5 

30 

<210> 2 
<211> 9 
<212> PRT 

<213> Artificial Sequence 

35 

<220> 

<223> Description of Artificial Sequence : LINKER 
<400> 2 

4 0 Ser Ser Ser Ala Ser Ala Ser Ser Ala 
1 5 



<210> 3 
45 <211> 7 

<212> PRT 

<213> Artificial Sequence 
<220> 

50 <223> Description of Artificial Sequence: LINKER 
<400> 3 

Gly Ser Pro Gly Ser Pro Gly 

1 ' 5 

55 

<210> 4 
<211> 11 
<212> PRT 
60 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : PRIMER 
65 <400> 4 



WO 00/24884 



2 
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Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr 
15 10 



5 <210> 5 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
10 <220> 

<223> Description of Artificial Sequence : PRIMER 
<40O> 5 

Ala Asn His Ser Gly Asn Ala Ser 
15 1 5 



<210> 6 
<211> 22 
20 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : PRIMER 

25 

<400> 6 

aggtsmarct gcagsagtcw gg . 22 

30 <210> 7 
<211> 53 
<212> DNA 

<213> Artificial Sequence 
35 <220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 7 

aacagttaag cttccgcttg cggccgcgga gctggggtct tcgctgtggt gcg 53 

40 

<210> 8 
<211> 53 
<212> DNA 
4 5 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : PRIMER 
50 <400> 8 

aacagttaag cttccgcttg cggccgctgg ttgtggtttt ggtgtcttgg gtt 53 



<210> 9 
55 <211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

60 <223> Description of Artificial Sequence : PRIMER 
<400> 9 

gggaattcca ataggtggtt agcaatcg 28 

65 
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<210> 10 
<211> 26 
<212> DNA 

<213> Artificial Sequence 

5 

<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 10 

10 gaccaacgtg gtcgcctggc aaaacg 26 

<210> 11 
<211> 26 
15 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: PRIMER 
<400> 11 

cgttttgcca ggcgaccacg ttggtc 26 



20 



25 <210> 12 
<211> 30 

<212> DNA 4 
<213> Artificial Sequence 

30 <220> 

<223> Description of Artificial Sequence: PRIMER 



35 



45 



50 



55 



<400> 12 

ccccaagctt acatggrtctt aagttggcgt 30 



<210> 13 
<211> 155 
<212> DNA 
4 0 <213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : PLASMID 
CONSTRUCT 



(110) 



<220> 




<221> 


CDS 


<222> 


(3). . 


<220> 




<221> 


CDS 


<222> 


(117) 


<220> 




<221> 


CDS 


<222> 


(147) 


<400> 


13 



(140) 



ga get cat cac aca aac aaa caa aac aaa atg atg ctt ttg caa gec 47 
60 Ala His His Thr Asn Lys Gin Asn Lys Met Met Leu Leu Gin Ala 

15 10 15 

ttc ctt ttc ctt ttg get ggt ttt gca gec aaa ata tct gcg cag gtg 95 
Phe Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys lie Ser Ala Gin Val 
65 20 25 30 
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cag ctg cag gag tea taatga ggg acc cag gtc acc gtc tec tea taatga 14 6 
Gin Leu Gin Glu Ser Gly Thr Gin Val Thr Val Ser Ser 

35 40 

ctt aag ctt 155 
Leu Lys Leu 
45 



<210> 14 
<211> 36 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence : PLASMID 
CONSTRUCT 

<400> 14 

Ala His His Thr Asn Lys Gin Asn Lys Met Met Leu Leu Gin Ala Phe 
15 10 15 

Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys lie Ser Ala Gin Val Gin 
20 25 30 

Leu Gin Glu Ser 
35 



<210> 15 
<211> 8 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: PLASMID 
CONSTRUCT 

<400> 15 

Gly Thr Gin Val Thr Val Ser Ser 
1 5 



<210> 16 
<211> 3 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: PLASMID 
CONSTRUCT 

<400> 16 
Leu Lys Leu 
1 



<210> 17 
<211> 188 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PLASMID 
CONSTRUCT 

<220> 

<221> CDS 

<222> (3) . . (110) 
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<220> 

<221> CDS 

<222> (117) . . (173) 



5 <220> 

<221> CDS 

<222> (180) . . (188) 

<400> 17 

10 ga get cat cac aca aac aaa caa aac aaa atg atg ctt ttg caa gec 47 

Ala His His Thr Asn Lys Gin Asn Lys Met Met Leu Leu Gin Ala 

15 10 15 

ttc ctt ttc ctt ttg get ggt ttt gca gec aaa ata tct gcg cag gtg 95 
15 Phe Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys lie Ser Ala Gin Val 

20 25 30 

cag ctg cag gag tea taatga ggg acc cag gtc acc gtc tec tea gaa 14 3 
Gin Leu Gin Glu Ser Gly Thr Gin Val Thr Val Ser Ser Glu 

20 35 40 45 

caa aaa etc ate tea gaa gag gat ctg aat taatga ctt aag ctt 188 
Gin Lys Leu lie Ser Glu Glu Asp Leu Xaa Xaa Lys Leu 

50 55 

25 

<210> 18 
<211> 36 
<212> PRT 
30 <213> Artificial Sequence 

<223> Description of Artificial Sequence : PLASMID 
CONSTRUCT 

<400> 18 

35 Ala His His Thr Asn Lys Gin Asn Lys Met Met Leu Leu Gin Ala Phe 
15 10 15 

Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys lie Ser Ala Gin Val Gin 
20 25 30 



40 



55 



Leu Gin Glu Ser 
35 



4 5 <210> 19 
<211> 19 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence : PLASMID 
50 CONSTRUCT 



<400> 19 

Gly Thr Gin Val Thr Val Ser Ser Glu Gin Lys Leu lie Ser Glu Glu 
15 10 15 

Asp Leu Xaa 



<210> 20 
60 <211> 3 

<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence : PLASMID 
CONSTRUCT 

65 
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<400> 20 
Xaa Lys Leu 
1 



10 



35 



50 



<210> 21 
<211> 342 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PLASMID 
CONSTRUCT 



15 <220> 

<221> CDS 

<222> (1) . . (342) 

<400> 21 

20 cag gtg cag ctg cag gag tea ggg gga ggc ttg gtg cag get ggg gag 

Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Glu 

15 10 15 

tct ctg aaa etc tec tgt gca gec tct gga aac ace ttc agt ggc ggc 
2 5 Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Asn Thr Phe Ser Gly Gly 
20 25 30 



gca acc att aat agt aga ggt ate aca aac tat gca gac ttc gtg aag 
Ala Thr He Asn Ser Arg Gly He Thr Asn Tyr Ala Asp Phe Val Lys 
50 55 60 

ggc cga ttc acc ate tec aga gac aat gec aag aag aca gtg tat ttg 
Gly Arg Phe Thr He Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr Leu 
65 70 75 80 



4 0 gaa atg aac age ctg gaa cct gaa gac acg gec gtt tat tac tgt tac 
Glu Met Asn Ser Leu Glu Pro Glu Asp Thr Ala Val Tyr Tyr Cys Tyr 
85 90 95 

act cac tac ttc aga tec tac tgg ggt cag ggg acc cag gtc acc gtc 
4 5 Thr His Tyr Phe Arg Ser Tyr Trp Gly Gin Gly Thr Gin Val Thr Val 
100 105 HO 



tec tea 
Ser Ser 



<210> 22 
<211> 114 
<212> PRT 
55 <213> Artificial Sequence 

<223> Description of Artificial Sequence : PLASMID 
CONSTRUCT 

<400> 22 

60 Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Glu 
15 10 15 

Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Asn Thr Phe Ser Gly Gly 
20 25 30 

65 



48 



96 



ttc atg ggc tgg tac cgc cag get cca ggg aag cag cgc gag ttg gtc 144 
Phe Met Gly Trp Tyr Arg Gin Ala Pro Gly Lys Gin Arg Glu Leu Val 
30 35 40 45 



192 



240 



288 



336 



342 



PCT/EP99/08323 
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Phe Met Gly Trp Tyr Arg Gin Ala Pro Gly Lys Gin Arg Glu Leu Val 
35 40 45 

Ala Thr He Asn Ser Arg Gly He Thr Asn Tyr Ala Asp Phe Val Lys 
50 55 60 

Glv Arg Phe Thr He Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr Leu 
65 70 75 80 

Glu Met Asn Ser Leu Glu Pro Glu Asp Thr Ala Val Tyr Tyr Cys Tyr 
85 90 95 

Thr His Tyr Phe Arg Ser Tyr Trp Gly Gin Gly Thr Gin Val Thr Val 
100 105 HO 

Ser Ser 

<210> 23 
<211> 351 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PLASMID 
CONSTRUCT 

<220> 

<221> CDS 

<222> (1) . . (351) 



<400> 23 

cag gtg cag ctg cag gag tea ggg gga gga ttg gtg cag gcg ggg ggc 
Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Gly 
1 5 10 15 

tct ctg aga etc tec tgt gca gec tct gga cgc acc ggc agt acg tat 
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Arg Thr Gly Ser Thr Tyr 
20 25 30 

gac atg ggc tgg ttc cgc cag get cca ggg aag gag cgt gag tct gta 
Asp Met Gly Trp Phe Arg Gin Ala Pro Gly Lys Glu Arg Glu Ser Val 
35 40 ( 45 

gca get att aac tgg gat agt gcg cgc aca tac tat gca age tec gtg 
Ala Ala He Asn Trp Asp Ser Ala Arg Thr Tyr Tyr Ala Ser Ser Val 
50 55 60 

agg ggc cga ttc acc ate tec aga gac aac gec aag aag acg gtg tat 
Arq Glv Arq Phe Thr He Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr 
65 70 75 80 

ctg caa atg aac age ctg aaa cct gag gac acg gec gtt tat acc tgt 
Leu Gin Met Asn Ser Leu Lys Pro Glu Asp Thr Ala Val Tyr Thr Cys 
85 90 95 



48 



96 



144 



192 



240 



288 



ggc gcg ggg gaa ggt ggt act tgg gac tec tgg ggc cag ggg acc cag 336 
Gly Ala Gly Glu Gly Gly Thr Trp Asp Ser Trp Gly. Gin Gly Thr Gin 
100 105 HO 



gtc acc gtc tec tea 
Val Thr Val Ser Ser 
115 



351 



WO 00/24884 . PCT/EP99/08323 

o 

<210> 24 
<211> 117 
<212> PRT 

<213> Artificial Sequence 
5 <223> Description of Axtificial Sequence ; PLASMID 
CONSTRUCT 

<400> 24 

Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Gly 
10 1 5 10 15 

Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Arg Thr Gly Ser Thr Tyr 
20 25 30 

15 Asp Met Gly Trp Phe Arg Gin Ala Pro Gly Lys Glu Arg Glu Ser Val 
35 40 45^ 



20 



Ala Ala He Asn Trp Asp Ser Ala Arg Thr Tyr Tyr Ala Ser Ser Val 
50 55 60 

Arq Gly Arg Phe Thr He Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr 
65 70 75 80 



Leu Gin Met Asn Ser Leu Lys Pro Glu Asp Thr Ala Val Tyr Thr Cys 
25 85 90 95 

Gly Ala Gly Glu Gly Gly Thr Trp Asp Ser Trp Gly Gin Gly Thr Gin 
100 105 * HO 

30 Val Thr Val Ser Ser 
115 



<210> 25 
35 <211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

4 0 <223> Description of Artificial Sequence : PRIMER 
<400> 25 

gaattaagcg gccgcccagg tgaaactgct cgagtcwggg gga 

45 

<210> 26 
<211> 42 
<212> DNA 

<213> Artificial Sequence 

50 

<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 26 

55 ccctgggtcc agtggcagag gagtggcaga ggagtcttgt tt 



<210> 27 

<211> 24 

60 <212> DNA 

<213> Artificial Sequence 



65 



<220> 

<223> Description of Artificial Sequence : PRIMER 
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<400> 27 

caggtccagc tgcaggagtc tggg 24 

5 <210> 28 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
10 <220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 28 

caggtgaaac tgctcgagtc wggg 24 

15 

<210> 29 
<211> 55 
20 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : LINKER; DOUBLE 
25 STRANDED 

<220> * 
<221> CDS 
<222> (2).. (40) 

<220> 
<221> CDS 
<222> (47) . . (55) 



30 



35 <400> 29 

g gtc acc gtc tec tea cag gtg cag ctg cag gag tea ctg taatga ctt 4 9 

Val Thr Val Ser Ser Gin Val Gin Leu Gin Glu Ser Xaa Xaa 
15 10 

4 0 aag ctt 55 
Lys Leu 
15 

4 5 <210> 30 

<211> 13 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence : LINKER; DOUBLE 

5 0 STRANDED 

<400> 30 

Val Thr Val Ser Ser Gin Val Gin Leu Gin Glu Ser Xaa 
15 10 

55 

<210> 31 
<211> 3 
<212> PRT 
60 <213> Artificial Sequence 

<223> Description of Artificial Sequence : LINKER; DOUBLE 
STRANDED 

<400> 31 
65 Xaa Lys Leu 
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1 
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PCT/EP99/08323 



<210> 32 
5 <211> 672 
<212> DNA 

<213> Artificial Sequence 
<220> 

10 <223> Description of Artificial Sequence : PLASMID 
CONSTRUCT 



15 



<220> 
<221> CDS 
<222> (1) . 



(672) 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



<400> 32 

etc gag tea ggg gga gga ttg gtg cag gcg ggg ggc tct ctg aga etc 
Leu Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Gly Ser Leu Arg Leu 
15 10 15 

tec tgt gca gec tct gga cgc acc ggc agt acg tat gac atg ggc tgg 
Ser Cys Ala Ala Ser Gly Arg Thr Gly Ser Thr Tyr Asp Met Gly Trp 
20 25 30 

ttc cgc cag get cca ggg aag gag cgt gag tct gta gca get att aac 
Phe Arg Gin Ala Pro Gly Lys Glu Arg Glu Ser Val Ala Ala He Asn 
35 40 45 

tgg gat agt gcg cgc aca tac tat gca age tec gtg agg ggc cga ttc 
Trp Asp Ser Ala Arg Thr Tyr Tyr Ala Ser Ser Val Arg Gly Arg Phe 
50 55 60 

acc ate tec aga gac aac gec aag aag acg gtg tat ctg caa atg aac 
Thr He Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr Leu Gin Met Asn 
65 70 75 80 

age ctg aaa cct gag gac acg gec gtt tat acc tgt ggc gcg ggg gaa 
Ser Leu Lys Pro Glu Asp Thr Ala Val Tyr Thr Cys Gly Ala Gly Glu 
85 90 95 

ggt ggt act tgg gac tec tgg ggc cag ggg acc cag gtc acc gtc tec 
Gly Gly Thr Trp Asp Ser Trp Gly Gin Gly Thr Gin Val Thr Val Ser 
100 105 HO 

tea cag gtg cag ctg cag gag tea ggg gga ggc ttg gtg cag get ggg 
Ser Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly 
115 120 125 

gag tct ctg aaa etc tec tgt gca gec tct gga aac acc ttc agt ggc 
Glu Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Asn Thr Phe Ser Gly 
130 135 140 

ggc ttc atg ggc tgg tac cgc cag get cca ggg aag cag cgc gag ttg 
Gly Phe Met Gly Trp Tyr Arg Gin Ala Pro Gly Lys Gin Arg Glu Leu 
145 150 155 160 

gtc gca acc att aat agt aga ggt ate aca aac tat gca gac ttc gtg 
Val Ala Thr He Asn Ser Arg Gly He Thr Asn Tyr Ala Asp Phe Val 
165 170 175 

aag ggc cga ttc acc ate tec aga gac aat gee aag aag aca gtg tat 
Lys Gly Arg Phe Thr He Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr 
180 185 190 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



WO 00/24884 



11 
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ttg gaa atg aac age ctg gaa cct gaa gac acg gec gtt tat tac tgt 624 
Leu Glu Met Asn Ser Leu Glu Pro Glu Asp Thr Ala Val Tyr Tyr Cys 
195 200 205 

5 tac act cac tac ttc aga tec tac tgg ggt cag ggg acc cag gtc acc 672 
Tyr Thr His Tyr Phe Arg Ser Tyr Trp Gly Gin Gly Thr Gin Val Thr 
210 215 220 

10 <210> 33 
<211> 224 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: PLASMID 
15 CONSTRUCT 

<400> 33 

Leu Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Gly Ser Leu Arg Leu 
1 5 10 15 



20 



Ser Cys Ala Ala Ser Gly Arg Thr Gly Ser Thr Tyr Asp Met Gly Trp 
20 25 30 



Phe Arg Gin Ala Pro Gly Lys Glu Arg Glu Ser Val Ala Ala lie Asn 
25 35 40 45 

Trp Asp Ser Ala Arg Thr Tyr Tyr Ala Sei; Ser Val Arg Gly Arg Phe 
50 55 60 

30 Thr He Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr Leu Gin Met Asn 
65 70 75 80 

Ser Leu Lys Pro Glu Asp Thr Ala Val Tyr Thr Cys Gly Ala Gly Glu 
85 90 95 

35 

Gly Gly Thr Trp Asp Ser Trp Gly Gin Gly Thr Gin Val Thr Val Ser 
100 105 HO 

Ser Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly 
40 115 120 125 

Glu Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Asn Thr Phe Ser Gly 
130 135 140 

4 5 Gly Phe Met Gly Trp Tyr Arg Gin Ala Pro Gly Lys Gin Arg Glu Leu 
145 150 155 160 



50 



Val Ala Thr He Asn Ser Arg Gly He Thr Asn Tyr Ala Asp Phe Val 
165 170 175 

Lys Gly Arg Phe Thr He Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr 
180 185 190 



Leu Glu Met Asn Ser Leu Glu Pro Glu Asp Thr Ala Val Tyr Tyr Cys 
55 195 200 205 

Tyr Thr His Tyr Phe Arg Ser Tyr Trp Gly Gin Gly Thr Gin Val Thr 
210 215 220 



60 



65 



<210> 34 
<211> 48 
<212> DNA 

<213> Artificial Sequence 



WO 00/24884 



12 
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<220> 

<223> Description of Artificial Sequence : SYNTHETIC 
INSERT 

5 <400> 34 

gtcaccgtct ctagatggcc accaggtgca gctgcaggag tcaactta 



<210> 35 
10 <211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

15 <223> Description of Artificial Sequence : SYNTHETIC 
INSERT 

<400> 35 

gcagagatct accggtggtc cacgtcgagc tcctcagttg aattcga 

20 

<210> 36 
<211> 23 
<212> DNA 
2 5 <213> Artificial Sequence 

<220> . 

<223> Description of Artificial Sequence : LINKER 

30 <400> 36 

ctagtggtac ttccggttcc cag 



<210> 37 
35 <211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

4 0 <223> Description of Artificial Sequence: LINKER 
<400> 37 

accatgaagg ccaagg 

45 

<210> 38 
<211> 35 
<212> DNA 

<213> Artificial Sequence 

50 

<220> 

<223> Description of Artificial Sequence : LINKER 
<400> 38 

55 ctagttcttc atctgcttct gcctcttcag cccag 



<210> 39 
<211> 28 
60 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : LINKER 

65 



23 



WO 00/24884 13 



PCT/EP99/08323 



<400> 39 

aagaagtaga cgaagacgga gaagtcgg zu 



5 <210> 40 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
10 <220> 

<223> Description of Artificial Sequence : LINKER 
<400> 40 

ctagtggttc tccaggttca ccaggtcag 

15 

<210> 41 
<211> 22 
<212> DNA 
2 0 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : LINKER 

25 <400> 41 

accaagaggt ccaagtggtc ca 



<210> 42 
30 <211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

35 <223> Description of Artificial Sequence : LINKER 
<400> 42 

ctagtgctac tacaactggt tcttcaccag gtccaactca g 

40 

<210> 43 
<211> 34 
<212> DNA 

<213> Artificial Sequence 

45 

<220> 

<223> Description of Artificial Sequence : LINKER 
<400> 43 

50 acgatgatgt tgaccaagaa gtggtccagg ttga 



<210> 44 
<211> 32 
55 <212> DNA 

<213> Artificial Sequence 



60 



<220> 

<223> Description of Artificial Sequence: LINKER 
<400> 44 

ctagtgctaa tcattctggt aatgcttctc ag 32 



65 



<210> 45 



WO 00/24884 



14 



PCT/EP99/08323 



<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : LINKER 



<400> 45 

acgattagta agaccattac gaaga 



25 



SEQUENCE LISTING 



<110> Unilever PLC 
Unilever N.V. 
FRENKEN, Leo G. J. 
HOWELL, Steven 
VAN DER VAART, Jan M. 

<120> Antigen-Binding Proteins 

<130> 56159-5041 

<140> US 09/807,172 
<141> 2001-04-10 

<150> PCT/EP98/ 06991 
<151> 1998-10-27 

<150> EP 99303118.6 
<151> 1999-04-22 

<150> PCT/EP99/08 323 
<151> 1999-10-22 

<160> 51 

<170> Patentln version 3.1 

<210> 1 

<211> 5 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Linker sequence 

<400> 1 

Gly Thr Ser Gly Ser 
1 5 



<210> 2 

<211> 9 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Linker sequence 

<400> 2 

Ser Ser Ser Ala Ser Ala Ser Ser Ala 
1 5 



<210> 3 



1 



<211> 7 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Linker sequence 

<400> 3 

Gly Ser Pro Gly Ser Pro Gly 
1 5 



<210> 


4 


<211> 


11 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Primer 


<400> 


4 



Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr 
15 10 



<210> 5 

<211> 8 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 5 

Ala Asn His Ser Gly Asn Ala Ser 
1 5 



<210> 6 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 6 

aggtsmarct gcagsagtcw gg 



<210> 7 

<211> 53 

<212> DNA 

<213> Artificial sequence 




<220> 
<223> 



Primer 



<400> 7 

aacagttaag cttccgcttg cggccgcgga gctggggtct tcgctgtggt gcg 



53 



<210> 8 

<211> 53 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 8 

aacagttaag cttccgcttg cggccgctgg ttgtggtttt ggtgtcttgg gtt 53 

<210> 9 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 



<210> 10 

<211> 26 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 10 

gaccaacgtg gtcgcctggc aaaacg 26 

<210> 11 

<211> 26 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 



<400> 9 

gggaattcca ataggtggtt agcaatcg 



28 



<400> 11 

cgttttgcca ggcgaccacg ttggtc 



26 



<210> 



12 



3 



ru n'.n n a 



<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 12 

ccccaagctt acatggtctt aagttggcgt 30 

<210> 13 
<211> 155 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Plasmid construct 
<400> 13 

gagctcatca cacaaacaaa caaaacaaaa tgatgctttt gcaagccttc cttttccttt 60 
tggctggttt tgcagccaaa atatctgcgc aggtgcagct gcaggagtca taatgaggga 120 
cccaggtcac cgtctcctca taatgactta agctt 155 

<210> 14 
<211> 36 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Plasmid construct 
<400> 14 

Ala His His Thr Asn Lys Gin Asn Lys Met Met Leu Leu Gin Ala Phe 
15 10 15 



Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys lie Ser Ala Gin Val Gin 
20 25 30 



Leu Gin Glu Ser 
35 



<210> 15 

<211> 8 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Plasmid construct 



4 



<400> 15 



Gly Thr Gin Val Thr Val Ser Ser 
1 5 



<210> 16 

<211> 5 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> First 5 coclons of HC-V domain 

<400> 16 

Gin Val Gin Leu Gin 
1 5 



<210> 17 
<211> 188 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Plasmid Construct 
<400> 17 

gagctcatca cacaaacaaa caaaacaaaa tgatgctttt gcaagccttc cttttccttt 
tggctggttt tgcagccaaa atatctgcgc aggtgcagct gcaggagtca taatgaggga 
cccaggtcac cgtctcctca gaacaaaaac tcatctcaga agaggatctg aattaatgac 
ttaagctt 



<210> 18 
<211> 36 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Plasmid construct 
<400> 18 

Ala His His Thr Asn Lys Gin Asn Lys Met Met Leu Leu Gin Ala Phe 
15 10 15 



Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys lie Ser Ala Gin Val Gin 
20 25 30 



Leu Gin Glu Ser 



35 



<210> 19 
<211> 19 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Plasmid construct 
<400> 19 

Gly Thr Gin Val Thr Val Ser Ser Glu Gin Lys Leu lie Ser Glu Glu 
15 10 15 



Asp Leu Asn 



<210> 20 

<211> 6 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Last 6 codons of HC-V domain 

<400> 20 

Gin Val Thr Val Ser Ser 



<210> 21 

<211> 342 

<212> DNA 

<213> Artificial sequence 



<220> 
<223> 



Plasmid Construct 



<220> 
<221> 
<222> 
<223> 



CDS 

(1) - . (342) 



<400> 21 

cag gtg cag ctg cag gag tea ggg gga ggc ttg gtg cag get ggg gag 

Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Glu 

15 10 15 



48 



tct ctg aaa etc tec tgt gca gec tct gga aac acc ttc agt ggc ggc 
Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Asn Thr Phe Ser Gly Gly 
20 25 30 



96 



6 



ttc atg ggc tgg tac cgc cag get cca ggg aag cag cgc gag ttg gtc 144 
Phe Met Gly Trp Tyr Arg Gin Ala Pro Gly Lys Gin Arg Glu Leu Val 
35 40 45 

gca acc att aat agt aga ggt ate aca aac tat gca gac ttc gtg aag 192 
Ala Thr lie Asn Ser Arg Gly lie Thr Asn Tyr Ala Asp Phe Val Lys 
50 55 60 

ggc cga ttc acc ate tec aga gac aat gee aag aag aca gtg tat ttg 240 
Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr Leu 
65 70 75 80 

gaa atg aac age ctg gaa cct gaa gac acg gec gtt tat tac tgt tac 288 
Glu Met Asn Ser Leu Glu Pro Glu Asp Thr Ala Val Tyr Tyr Cys Tyr 
85 \ 90 95 

act cac tac ttc aga tec tac tgg ggt cag ggg acc cag gtc acc gtc 336 
Thr His Tyr Phe Arg Ser Tyr Trp Gly Gin Gly Thr Gin Val Thr Val 
100 105 110 

tec tea 342 
Ser Ser 



<210> 22 
<211> 114 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Plasmid Construct 
<400> 22 

Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Glu 
15 10 15 



Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Asn Thr Phe Ser Gly Gly 
20 25 30 



Phe Met Gly Trp Tyr Arg Gin Ala Pro Gly Lys Gin Arg Glu Leu Val 
35 40 45 



Ala Thr lie Asn Ser Arg Gly lie Thr Asn Tyr Ala Asp Phe Val Lys 
50 55 60 



Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr Leu 
65 70 75 80 



Glu Met Asn Ser Leu Glu Pro Glu Asp Thr Ala Val Tyr Tyr Cys 'Tyr 
85 90 95 



7 



Thr His Tyr Phe Arg Ser Tyr Trp Gly Gin Gly Thr Gin Val Thr Val 
100 105 110 



Ser Ser 



<210> 23 
<211> 351 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Plasmid Construct 
<220> 

<221> CDS 

<222> (1) . . (351) 

<223> 

; 

<400> 23 

cag gtg cag ctg cag gag tea ggg gga gga ttg gtg cag gcg ggg ggc 
Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Gly 
15 10 15 

tct ctg aga etc tec tgt gca gee tct gga cgc ace ggc agt acg tat 
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Arg Thr Gly Ser Thr Tyr 
20 25 30 



gac atg ggc tgg ttc cgc cag get cca ggg aag gag cgt gag tct gta 
Asp Met Gly Trp Phe Arg Gin Ala Pro Gly Lys Glu Arg Glu Ser Val 
35 4 0 4 5 J 



gca get att aac tgg gat agt gcg cgc aca tac tat gca age tec gtg 

Ala Ala lie Asn Trp Asp Ser Ala Arg Thr Tyr Tyr Ala Ser Ser Val 
50 55 60 

agg ggc cga ttc ace ate tec aga gac aac gee aag aag acg gtg tat 

Arg Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr 

65 70 75 80 

ctg caa atg aac age ctg aaa cct gag gac acg gec gtt tat acc tgt 

Leu Gin Met Asn Ser Leu Lys Pro Glu Asp Thr Ala Val Tyr Thr Cys 

85 90 95 

ggc gcg ggg gaa ggt ggt act tgg gac tec tgg ggc cag ggg acc cag 

Gly Ala Gly Glu Gly Gly Thr Trp Asp Ser Trp Gly Gin Gly Thr Gin 

100 105 110 



gtc acc gtc tec tea 
Val Thr Val Ser Ser 
115 



<210> 24 
<211> 117 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Plasmid Construct 
<400> 24 

Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Gly 
15 10 15 



Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Arg Thr Gly Ser Thr Tyr 
20 25 30 



Asp Met Gly Trp Phe Arg Gin Ala Pro Gly Lys Glu Arg Glu Ser Val 
35 40 45 



Ala Ala lie Asn Trp Asp Ser Ala Arg Thr Tyr Tyr Ala Ser Ser Val 
50 55 60 



Arg Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr 
65 70 75 80 



Leu Gin Met Asn Ser Leu Lys Pro Glu Asp Thr Ala Val Tyr Thr Cys 
85 90 95 



Gly Ala Gly Glu Gly Gly Thr Trp Asp Ser Trp Gly Gin Gly Thr Gin 
100 ' 105 110 



Val Thr Val Ser Ser 





' 115 


<210> 


25 


<211> 


43 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Primer 


<400> 


25 



gaattaagcg gccgcccagg tgaaactgct cgagtcwggg gga 



<210> 26 
<211> 42 
<212> DNA 



9 



v - > 



<213> Artificial sequence 
<220> 

<223> Primer 
<400> 26 

ccctgggtcc agtggcagag gagtggcaga ggagtcttgt tt 



<210> 27 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 27 

caggtccagc tgcaggagtc tggg 

<210> 28 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 28 

caggtgaaac tgctcgagtc wggg 



<210> 29 

<211> 55 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Linker; Double stranded 
<220> 

<221> CDS 

<222> (2) . . (40) 
<223> 

<400> 29 

g gtc acc gtc tec tea cag gtg cag ctg cag gag tea ctg taatgactta 

Val Thr Val Ser Ser Gin Val Gin Leu Gin Glu Ser Leu 
15 10 

agctt 



<210> 30 
<211> 13 
<212> PRT 



10 



<213> Artificial sequence 



<220> 

<223> Linker; Double stranded 
<400> 30 

Val Thr Val Ser Ser Gin Val Gin Leu Gin Glu Ser Leu 
15 10 



<210> 31 

<211> 8 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 31 

Gin Val Gin Leu Gin Glu Ser Gly 
1 5 



<210> 32 

<211> 672 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Plasmid Construct 
<220> 

<221> CDS 

<222> (1) . . (672) 

<223> 

<400> 32 

etc gag tea ggg gga gga ttg gtg cag gcg ggg ggc tct ctg aga etc 48 

Leu Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Gly Ser Leu Arg Leu 
15 10 15 

tec tgt gca gec tct gga cgc acc ggc agt acg tat gac atg ggc tgg 96 

Ser Cys Ala Ala Ser Gly Arg Thr Gly Ser Thr Tyr Asp Met Gly Trp 
20 25 30 

ttc cgc cag get cca ggg aag gag cgt gag tct gta gca get att aac 144 
Phe Arg Gin Ala Pro Gly Lys Glu Arg Glu Ser Val Ala Ala lie Asn 
35 40 45 

tgg gat agt gcg cgc aca tac tat gca age tec gtg agg ggc cga ttc 192 
Trp Asp Ser Ala Arg Thr Tyr Tyr Ala Ser Ser Val Arg Gly Arg Phe 
50 55 60 

acc ate tec aga gac aac gec aag aag acg gtg tat ctg caa atg aac 240 
Thr lie Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr Leu Gin Met Asn 



11 



( if || O j| J jj \/ -ji tp 1 }ji jf'j? if 



65 70 75 80 

age ctg aaa cct gag gac acg gec gtt tat acc tgt ggc gcg ggg gaa 288 
Ser Leu Lys Pro Glu Asp Thr Ala Val Tyr Thr Cys Gly Ala Gly Glu 
85 90 95 

ggt ggt act tgg gac tec tgg ggc cag ggg acc cag gtc acc gtc tec 336 
Gly Gly Thr Trp Asp Ser Trp Gly Gin Gly Thr Gin Val Thr Val Ser 
100 105 110 

tea cag gtg cag ctg cag gag tea ggg gga ggc ttg gtg cag get ggg 384 
Ser Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly 
115 120 125 

gag tct ctg aaa etc tec tgt gca gee tct gga aac acc ttc agt ggc 432 
Glu Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Asn Thr Phe Ser Gly 
130 135 ■ 140 

ggc ttc atg ggc tgg tac cgc cag get cca ggg aag cag cgc gag ttg 480 
Gly Phe Met Gly Trp Tyr Arg Gin Ala Pro Gly Lys Gin Arg Glu Leu 
145 150 155 160 

gtc gca acc att aat agt aga ggt ate aca aac tat gca gac ttc gtg 528 
Val Ala Thr lie Asn Ser Arg Gly lie Thr Asn Tyr Ala Asp Phe Val 
165 170 175 

aag ggc cga ttc acc ate tec aga gac aat gee aag aag aca gtg tat 576 
Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr 
180 185 190 

ttg gaa atg aac age ctg gaa cct gaa gac acg gee gtt tat tac tgt 624 
Leu Glu Met Asn Ser Leu Glu Pro Glu Asp Thr Ala Val Tyr Tyr Cys 
195 200 205 

tac act cac tac ttc aga tec tac tgg ggt cag ggg acc cag gtc acc 672 
Tyr Thr His Tyr Phe Arg Ser Tyr Trp Gly Gin Gly Thr Gin Val Thr 
210 215 220 



<210> 


33 


<211> 


224 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


Plasmid Construct 


<400> 


33 



Leu Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Gly Ser Leu Arg Leu 
15 10 15 



Ser Cys Ala Ala Ser Gly Arg Thr Gly Ser Thr Tyr Asp Met Gly Trp 
20 25 30 



12 



Phe Arg Gin Ala Pro Gly Lys Glu Arg Glu Ser Val Ala Ala lie Asn 
35 40 45 



Trp Asp Ser Ala Arg Thr Tyr Tyr Ala Ser Ser Val Arg Gly Arg Phe 
50 55 60 



Thr lie Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr Leu Gin Met Asn 
65 70 75 80 



Ser Leu Lys Pro Glu Asp Thr Ala Val Tyr Thr Cys Gly Ala Gly Glu 
85 90 95 



Gly Gly Thr Trp Asp Ser Trp Gly Gin Gly Thr Gin Val Thr Val Ser 
100 105 110 



Ser Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly 
115 120 125 



Glu Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Asn Thr Phe Ser Gly 
130 135 140 



Gly Phe Met Gly Trp Tyr Arg Gin Ala Pro Gly Lys Gin Arg Glu Leu 
145 150 155 160 



Val Ala Thr lie Asn Ser Arg Gly lie Thr Asn Tyr Ala Asp Phe Val 
165 170 175 



Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr 
180 185 190 



Leu Glu Met Asn Ser Leu Glu Pro Glu Asp Thr Ala Val Tyr Tyr Cys 
195 "> 200 205 



Tyr Thr His Tyr Phe Arg Ser Tyr Trp Gly Gin Gly Thr Gin Val Thr 
210 215 220 



<210> 


34 


<211> 


48 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic Insert 


<400> 


34 



13 



gtcaccgtct ctagatggcc accaggtgca gctgcaggag tcaactta 



<210> 


35 


<211> 


47 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic insert 


<400> 


35 


gcagagatct accggtggtc cacgtcgagc 


<210> 


36 


<211> 


23 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Linker 


<400> 


36 



ctagtggtac ttccggttcc cag 



<210> 


37 


<211> 


16 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Linker 


<400> 


37 


accatgaagg ccaagg 


<210> 


38 


<211> 


35 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Linker 



<400> 38 

ctagttcttc atctgcttct gcctcttcag cccag 



<210> 39 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 



14 



<223> Linker 



<400> 39 

aagaagtaga cgaagacgga gaagtcgg 



28 



<210> 40 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Linker 

<400> 40 

ctagtggttc tccaggttca ccaggtcag 29 



<210> < 41 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Linker 



<210> 42 

<211> 41 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Linker 

<400> 42 

ctagtgctac tacaactggt tcttcaccag gtccaactca g 41 



<210> 43 

<211> 34 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Linker 



<400> 41 



accaagaggt ccaagtggtc ca 



22 



<400> 43 

acgatgatgt tgaccaagaa gtggtccagg ttga 



34 



<210> 
<211> 
<212> 



44 
32 
DNA 



15 



<213> Artificial sequence 



<220> 

<223> Linker 
<400> 44 

ctagtgctaa tcattctggt aatgcttctc 



<210> 45 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Linker 

<400> 45 

acgattagta agaccattac gaaga 

<210> 46 

<211> 8 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Primer 

<400> 46 

Gin Val Lys Leu Leu Glu Ser Gly 
1 5 



<210> 47 

<211> 7 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Linker Fragment 

<400> 47 

Ser Gly Thr Ser Gly Ser Gin 
1 5 



<210> 48 

<211> 11 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Linker Fragment 



<400> 48 

Ser Ser Ser Ser Ala Ser Ala Ser Ser Ala Gin 
15 10 



<210> 


49 


<211> 


9 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


Linker Fragment 


<400> 


49 



Ser Gly Ser Pro Gly Ser Pro Gly Gin 
1 5 



<210> 50 
<211> 13 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Linker Fragment 
<400> 50 

Ser Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gin 
15 10 



<210> 


51 


<211> 


10 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


Linker Fragment 


<400> 


51 



Ser Ala Asn His Ser Gly Asn Ala Ser Gin 
15 10 



17 



